Add diffusion model implementation #408

vpratz · 2025-04-13T13:10:04Z

This PR adds a diffusion model implementation for use as an inference network, as discussed in #403. It implements the design introduced as "EDM" in [1]. The overall structure is taken from the FlowMatching class.

@arrjon @niels-leif-bracher I would appreciate if you take a look and make suggestions regarding how we can incorporate the other diffusion model variants as well. For now, I chose to only expose the sigma_data parameter to the end user, and keep everything else private. This should enable us to also change the internals later on and incrementally add new functionality.

Please let me know how we want to proceed and how much capacity you have to move this forward, so that we can decide whether we want to include the additional options before we merge, or if we merge early and then incrementally add to it later. I have situated the class in the experimental module for now, so that we have some freedom to also change things in the future as we see fit.

[1] https://arxiv.org/abs/2206.00364

Preliminary implementation, to be extended with other variants as well.

codecov · 2025-04-13T13:21:15Z

Codecov Report

Attention: Patch coverage is 60.29777% with 160 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
bayesflow/experimental/diffusion_model.py	64.85%	129 Missing ⚠️
bayesflow/utils/integrate.py	8.82%	31 Missing ⚠️

Files with missing lines	Coverage Δ
bayesflow/experimental/__init__.py	`100.00% <100.00%> (ø)`
bayesflow/utils/__init__.py	`100.00% <100.00%> (ø)`
bayesflow/utils/integrate.py	`44.06% <8.82%> (-8.71%)`	⬇️
bayesflow/experimental/diffusion_model.py	`64.85% <64.85%> (ø)`

arrjon · 2025-04-14T12:56:58Z

Thanks @vpratz for the implementation! sigma_data is already specific for the EDM version. I think, we have to make it even more general, so some additional arguments which can be passed, depending on the type of schedule one wants to use.

I plan to add on top of this additional schedules and samplers until the end of the week.

vpratz · 2025-04-14T13:33:32Z

Thanks for taking a look. Do you know whether your implementation would benefit from the pre-conditioning discussed in Elucidating the Design Space of Diffusion-Based Generative Models, and whether we can combine them in one joint framework?

arrjon · 2025-04-14T14:01:15Z

Part of the pre-conditioning can be expressed as a special kind of weighting function: see appendix D.1 in here.

So yes, the aim would be to have one nice framework!

arrjon · 2025-04-16T09:32:49Z

I added some more noise schedules and started to make the implementation more general. This is just a first draft, so you @vpratz get an idea, how we could do it. We should discuss this then and how to move forward.

arrjon · 2025-04-23T20:20:15Z

I added a class NoiseSchedule and different schedules, so it should be easy now to extend to more schedules if necessary. Since EDM has a specific sampling scheme for inference, this is now also defined in the noise schedule. Therefore, we do not have to specify specific sampling step sizes anymore.

Next step would be add stochastic samplers as well.

vpratz · 2025-04-29T09:13:34Z

Thanks a lot for the fixes, they increase the performance of the new implementation a lot. The old standalone EDM implementation seems to be a little bit better still, but the difference might be down to hyperparameter tuning. I have added the examples/experimental/Two_Moons_Diffusion_Comparison.ipynb notebook, which allows testing both implementations on the same benchmark, and plotting the results against each other. This is for two moons, but feel free to expand it with other benchmarks as well.

As far as I can tell, the open steps before we finalize this PR are:

optimising performance: ensuring that we achieve the same performance as with the standalone EDM implementation, so that we do not forego performance by not including it
related: find and set good defaults
remove the LinearNoiseSchedule (as discussed privately, maybe moving it to a tutorial): the other schedules perform better, so we do not need to include and maintain it here
add tests to cover all relevant cases/combinations
proof-reading docstrings and maybe supplying a tutorial
remove the examples/experimental/Two_Moons_Diffusion_Comparison.ipynb notebook before merging

Did I miss anything, @arrjon , or do you have any other comments on the current state?

arrjon · 2025-04-29T13:55:28Z

The performance issue is fixed now, it was mainly due to a missing scaling factor of the log_snr, which goes into the network.

I am also implementing the stochastic sampler: it is working for all backend but jax at the moment. After this, only the things @vpratz mentioned are missing.

arrjon · 2025-04-29T14:21:37Z

The stochastic sampler is now also working for jax. So all features done for the moment!

vpratz · 2025-04-29T15:45:58Z

Great! Thanks a lot for putting in the work and for the quick fixes! I'll try to add the relevant tests and work on some of the other missing things in the next few days.

Add diffusion model implementation, EDM variant

549a055

Preliminary implementation, to be extended with other variants as well.

vpratz requested review from niels-leif-bracher and arrjon April 13, 2025 13:10

vpratz added the feature New feature or request label Apr 13, 2025

arrjon self-assigned this Apr 14, 2025

adding more noise schedules

630a823

Base automatically changed from dev to main April 22, 2025 14:37

adding noise scheduler class

c1cb183

arrjon and others added 2 commits April 23, 2025 22:23

adding noise scheduler class

49c0cb7

Merge branch 'main' into feat-diffusion-model

5f11724

vpratz changed the base branch from main to dev April 24, 2025 07:17

vpratz and others added 13 commits April 24, 2025 09:19

Merge branch 'dev' into feat-diffusion-model

280b651

fix backend

e840046

fix backend

f2d7de4

wip: adapt network to layer paradigm

d5dc2ba

improve schedules

739491a

Merge branch 'feat-diffusion-model-adapt' into feat-diffusion-model

efeff85

add serialization, remove unnecessary tensor conversions

92131d7

format inference network conftest.py

bd564b5

add dtypes and type casts in compute_metrics

0f7b3f5

disable clip on x by default

2ce74f0

fixes: use squared g, correct typo in _min_t

01b33dc

integration should be from 1 to 0

6031212

add missing seed_generator param

d82e2bf

arrjon and others added 22 commits April 25, 2025 09:58

stochastic sampler fix

548f51b

fix scale base dist

194a503

EDM training bounds

196683c

minor changes

5b52499

fix base distribution

eb96620

seed in stochastic sampler

668f6fc

seed in stochastic sampler

1a970c2

seed in stochastic sampler

ebafc5e

seed in stochastic sampler

9941fa3

seed in stochastic sampler

afaebef

seed in stochastic sampler

c1558c5

fix is_symbolic_tensor

1efd88f

[skip ci] skip step_fn for tracing (dangerous, subject to removal)

7456cdb

seed in stochastic sampler

a722729

seed in stochastic sampler

ee0c87b

fix loss

f2cbde6

fix loss

7b7b15a

improve schedules

1811038

improve schedules

9d13264

improve edm

4e0b7f8

temporary: add notebook to compare implementations

a028e8a

Merge remote-tracking branch 'upstream/dev' into feat-diffusion-model

1f15b7d

arrjon added 3 commits April 29, 2025 13:42

add loss types

6794342

add loss types

7c527a5

scale snr

5ca609f

fix stochastic sampler

79be9ab

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add diffusion model implementation #408

Add diffusion model implementation #408

vpratz commented Apr 13, 2025

codecov bot commented Apr 13, 2025 •

edited

Loading

arrjon commented Apr 14, 2025

vpratz commented Apr 14, 2025

arrjon commented Apr 14, 2025

arrjon commented Apr 16, 2025

arrjon commented Apr 23, 2025

vpratz commented Apr 29, 2025

arrjon commented Apr 29, 2025

arrjon commented Apr 29, 2025

vpratz commented Apr 29, 2025

Add diffusion model implementation #408

Are you sure you want to change the base?

Add diffusion model implementation #408

Conversation

vpratz commented Apr 13, 2025

codecov bot commented Apr 13, 2025 • edited Loading

Codecov Report

arrjon commented Apr 14, 2025

vpratz commented Apr 14, 2025

arrjon commented Apr 14, 2025

arrjon commented Apr 16, 2025

arrjon commented Apr 23, 2025

vpratz commented Apr 29, 2025

arrjon commented Apr 29, 2025

arrjon commented Apr 29, 2025

vpratz commented Apr 29, 2025

codecov bot commented Apr 13, 2025 •

edited

Loading